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6 CMP Polishing Pads 

David B. James 



6.1 Introduction 

Although polishing is an old technology [1, 2, 3], it has become an enabling 
process for the manufacture of leading-edge semiconductor devices. In the 
manufacture of such devices, polishing is used to maintain planarity at each 
step in the process of depositing and photohthographically imaging sequential 
insulating dielectric and conductive metal layers. Also as noted in the first 
chapter, CMP is now also employed to remove a bulk film and then stop, 
as in damascene copper and tungsten poHshing. As semiconductor devices 
become increasingly complex, requiring finer feature geometries and more 
metalhzation layers, greater demands are placed on the polishing consumables 
used in the CMP process to manufacture such devices [4, 5, 6]. 

Pohshing consumables usually comprise a polymeric polishing pad used in 
conjunction with an aqueous based poHshing slurry [7]. Conventionally, the 
slurry contains abrasive particles but, also recently, two variants on the stan- 
dard consumables have been studied. In the first new approach, the abrasive 
particles have been incorporated into the pad and in the second, the pad is 
used with a particle-free reactive liquid. This chapter will focus on conven- 
tional polishing pads and, for completeness, will include only a brief section 
on slurryless pad technology. More specifically, this chapter will discuss poly- 
mer criteria for pohshing pads, types of pads available and their manufactvire, 
control of pad properties, and the relationship of those properties to polishing 
performance. 



6.2 Polymer Requirements for Polishing Pads 

Since polishing is both a mechanical and a chemical process, the polymeric 
pohshing pad must have sufficient mechanical integrity and chemical resis- 
tance to survive the rigors of poHshing. As discussed later in this chapter, 
important mechanical properties include high strength to resist tearing dur- 
ing polishing, acceptable levels of hardness and modulus selected based on 
the material being polished, and good abrasion resistance to prevent excessive 
pad wear during poHshing. 
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Chemically, the pad must be able to survive the aggressive slurry chem- 
istries used in CMP polishing without degrading, delaminating, blistering or 
warping. Slurry chemistries include either highly alkaline slurries near pH 11 
used for polishing of inter-layer dielectric oxide layers or highly acidic, oxi- 
dizing slurries used for polishing metal fikas, such as timgsten and copper. 
Slurries for metal CMP can have pH values less than 2 and contain oxidizing 
agents such as hydrogen peroxide, ferric nitrate or potassium iodate. Typi- 
cally, over the lifetime of a single pad, it is exposed to these chemistries for 
many horns and often several days, usually at temperatures above ambient. 

A third criterion is that the pads must be sufficiently hydrophilic. The 
aqueous-based abrasive containing slurry or the particle-free reactive liquid 
must wet the surface of the pad and form a liquid film between the wafer and 
the pad. If the liquid does not wet but instead beads on the pad surface, it 
will be swept away by the wafer edge and the interior of the wafer will be 
starved of the necessary chemistry to enable effective polishing. 

Pad hydrophihcity may be expressed in terms of the pad's Critical Surface 
Tension. This is defined as the wettability of a soUd surface by noting the 
lowest surface tension a liquid can have and still exhibit a contact angle 
greater than zero degrees on that solid. In terms of polishing, this means 
that pads made from polymers with higher critical surface tension values are 
more hydrophilic and slurry will more readily wet these pads. 

Table 6.1 shows critical surface tension values for several conunerciaUy 
available polymers [8]. 

Prom the Table 6.1, polymers with higher critical surface tension values 
correspond to those polymers which are useful as polishing pads. The mini- 
mum value is around 37 mN/m, and preferred values are in the mid 40's. This 
range includes polymers such as poly (methyl methacrylate), polycarbonates, 
nylons, polysulfones, and polyurethanes. 

As the critical surface tension of the pad decreases, polishing performance 
will also decrease. Thus one would expect polyethylene and PTFE pads to 
perform poorly because slurry does not spread over the pad surface. However, 
a number of approaches have been used to enable less hydrophilic polymers to 
be used as polishing pads. These include adding high levels of wetting agents 
to the slurry, adding hydrophilic fillers such as sihca to normally hydrophobic 
polymers, or by chemically modifying the hydrophobic polymer to make it 
more hydrophihc. Techniques used for the latter include plasma treatment, 
corona discharge or the chemical addition of polar groups to the pad surface. 
However, since pads are continually abraded during use, either by condition- 
ing prior to polishing or by the polishing process itself, the treatment needs 
to be effective through the whole cross-section of the pad and not just the 
surface layer. 

Since poHshing is a wet process, the mechanical properties of the pad must 
be essentially retained, even when the pad is wet. However, the properties 
of hydrophihc polymers which are preferred for polishing pads will change 
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Table 6.1. Critical Surface Tension Values of Polymers 



Polymer 


Critical Surface Tension (wN/m) 


Polytetrafluoroetliylene 


19 


Polydimethylsiloxane 


24 


Silicone Rubber 


24 


Polybutadiene 


31 


Polyethylene 


31 


Polystyrene 


33 


Polypropylene 


34 


Polyacrylamide 


35-40 


PolyT/^inyl alcohol 


37 


Polymethyl methacrylate 


39 


Polyvinyl chloride 


39 


Polysulfone 


41 


Nylon 6 


42 


Polycarbonate 


45 


Polyurethane 


45 



during exposure to aqueous polishing slurries. Typically water acts as a plas- 
ticizer which decreases pad modulus and hardness, and increases ductility. 
Thus under polishing conditions, the pads will become more ductile and flex- 
ible. These changes are reversible and pad properties revert to the original 
values on drying, indicating that chemical attack of the pad is minimal. 

The changes in pad properties during polishing are important for those 
researchers interested in modeUng the mechanisms of CMP polishing. Often 
dry properties rather than the more appropriate wet properties are used in 
the models. Secondly, most properties measured are bulk properties. Since 
polishing is an interfacial process, in many cases the properties of the pad 
surface rather than the bulk should be used. Surface properties change rapidly 
during immersion in slurry and quickly reach equilibrium values [61]. 

The final criterion for the polymer is that the polymer formulation and 
morphology can be varied to give pads with specific, predictable properties 
for different polishing apphcations. Thus a family of pads is preferred, such 
that performance may be fine-tuned for the specific polishing appHcation, 
polishing tool, wafers and slurry. 

The types of polymers which best satisfy the criteria discussed above 
are polyurethanes. Polyurethanes combine good mechanical properties with 
excellent chemical stabihty and, as shown in the next section, their proper- 
ties may be readily and precisely controlled. Furthermore, with polyurethane 
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technology it is possible to fabricate a wide range of pad microstmctures 
including foams, impregnated felts and solid pads, and to use a variety of 
polyurethane manufacturing processes including casting, molding, extrusion, 
web-coating and sintering. These will be discussed in more detail later in the 
chapter. 



6.3 Basics of Poly ur ethanes 

Many books have been written covering the chemistry, morphology, prop- 
erties and manufacture of polyurethanes, and the reader is referred to 
these [9, 10, 11] for in-depth discussions. This section will provide a brief 
overview of polyurethanes and cover those aspects most relevant to pohshing 
pad technology. 

6.3.1 Formulations 

Polyurethanes axe typically composed of at least three components: 

1. Long chain polyol 

2. Diiocyanate or isocyanate of higher functionality 

3. Chain Extender. 

Most polyols used in the manufacture of polyurethanes are polyethers with 
terminal hydroxyl groups. Hydroxyl-terminated polyesters are sometimes 
used to obtain polyurethanes with special properties. However, polyester di- 
ols tend to be more expensive and are less chemically stable than polyether 
diols, especially in basic solution. The choice of polyol, especially the size 
(molecular weight), flexibility of its molecular structure, and functionahty, 
has a large effect on the properties of the resultant polyurethane. 

A second method of varying the properties of a polyurethane is through 
the selection of the isocyanate. Several aromatic and aliphatic isocyanates 
are available, but 95% of all polyurethanes are based on either toluene diiso- 
cyanate (TDI) or diisocyanato-diphenyhnethane (MDI) and its derivatives. 
MDI is often preferred over TDI because of its lower toxicity and greater 
chemical flexibiUty. Although pure MDI is a difunctional soUd having two 
reactive isocyanate groups per molecule, producers of isocyanates have de- 
veloped liquid MDI variants of higher functionality than two. This enables 
control of crosslinking reactions and hence of the resultant polyurethane prop- 
erties. 

The third important component in a polyurethane formulation are small 
molecules known as chain extenders. These are chemicals containing groups 
which can react with isocyanates and link such isocyanates together to in- 
troduce specialized polymer segments into the polyurethane backbone. Ex- 
amples include molecules such as low molecular diols (e.g. ethylene glycol or 
butane diol), diamines, and water. 
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Fundamentally, there are three approaches to formulating polyurethanes, 
described by the urethane industry as "prepolymer" , "quasi" , and "one-shot" . 
In the "one-shot" approach all the components are mixed together and re- 
acted at one time. This can result in a highly exothermic reaction which is 
more difficult to control and can lead to reproducibihty problems. In the "pre- 
polymer" approach, the isocyanate is pre-reacted with the long chain diol to 
form a high molecular weight isocyanate terminated moiety. This can then be 
further reacted with diol or diamine curatives to complete the polyurethane 
forrtiation. The advantages of this approach are greater control over the chem- 
istry and a more consistent product. The "quasi" approach is intermediate 
between the two other approaches. 

Additional ingredients commonly included in commercial polyurethane 
formulations for polishing pads are catalysts, fillers and blowing agents. 

6.3.2 Chemistry 

Polyurethanes are addition polymers [12] formed by reaction of di- or poly- 
functional isocyanates with polyols: 

Polyol + Isocyanate = Polyurethane. 

However, urethane reactions are actually much more complex. This is 
a consequence of the high reactivity of isocyanates which can react with any 
molecules present containing active hydrogen groups, and by the fact that 
it is usual to use a stoichiometric excess of isocyanate with respect to diol. 
Thus, under suitable conditions, many secondary reactions are possible, such 
as: 

Isocyanate 4- Urethane = AUophanate 

2 X Isocyanate = Uretidinedione (Dimerization) 

3 X Isocyanate = Isocyanurate (Trimerization) 

Isocyanate + Water = Carbon Dioxide. 

These reactions can be preferentially controlled through the reaction con- 
ditions. Thus one reaction over another can be favored through the use of 
catalysts and by controUing the reaction temperature. The first three of the 
above reactions create chemical cross-links between chains and a network 
structure. As will be iUustrated later, as a consequence of the polyurethane 
cross-linking reactions, pad properties can be fine-tuned through control of 
the stoichiometric ratio of isocyanate to diol. 

6.3.3 Morphology 

Polyurethanes are multi-phase materials with complex morphologies [9, 10, 
11]. Their molecular structures vary from rigid cross-linked polymers to hn- 
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ear MeMv extensible elastomers. A common feature, however, of polyur- 
eS'ane? the presence of so-called "soft" and W segments. The type 
ald'ltive amom^t-of these in the polyurethane structure are major deter- 
minants m controUing polyurethane properties. ,„.VHfnrma 
Soft segments are quite mobile and are normally present m coxled forma- 
tion Che^cally, they comprise the big molecular -^^8 d>am id 
component of the formulation. The mobihty of molecular chams m the soft 
~t results iu moreased flexibility, toughness and impact resistance. Mo- 
S depends on the chemical nature and chain length of the soft segraent. 
ifX the soft segment should be amorphous and have a low glass tra*. 
temperature Pha^e separation increases with mcreasmg cham le^h 
Id decreLg polarity of the soft segment due to less tod -g--V-fl 
segment interaction. Preferred molecular weights are m the 1000 to 4000 
aTe At higher molecular weights, especially at low hard segment amounts, 
ttee is a tidency for the soft segments to crystallize which will reduce the 
elastomeric benefits conferred by the soft segments. 

"TegmeBts alternate with hard segments which are stiff — haae 
units, principaUy composed of reacted isocya^ate and 
eties Hard segments act as pseudo cross-links and control the dmensional 
them J rtabilfty of polyurethanes. Thus properties such as strength and s^ifE- 
at el^t J temperatures are controlled by the hard segments. Above 
a^Iin Cerature, the hard segments "melf and, iu the absence of 
ch^l cros^toks, the polyurethane becomes thermoplastic with greatly 
reduced strength and stiffness. . 

6.4 Types of Commercially Available Polishing Pads 
and Their Manufacture 

6.4.1 Types of Pads 

This section wiU cover the types of pad that axe coimnerdally available and 
currently used for CMP polishing. FoUowing the classification of Cook [IS], 
tS may be categorized into four types differentiated by their nncrostruc- 
ture. The types are: 

Type 1: Polymer Impregnated Felts 
Type 2: Poromerics (synthetic leathers) 
Type 3: Filled Polymer Sheets 
Type 4: Unfilled Textured Polymer Sheets. 

Table 6.2 summarizes the key features, properties, commercial trade 
names, and typical apphcations for the different pad types. 
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Table 6.2. Key Feattires, Properties and Applications for Different Pad Types 

T ype 1 Type 2 Type 3 Type 4 

Structure Felted fibers Porous film Microporous Non-porous 

impregnated coated on a polymer sheet polymer sheet 

with polymeric supporting "with surface 

binder substrate macrotexture 

Microstructure Continuous Vertically Closed ceU None 

channels oriented, open foam 

between fi bers pores 

Slurry loading Medium ffigh Low Minimal 

capacity 



Pad Examples PeUon™, Suba Politex™, ICIOOO™, OXP3000™, 
TM Surfin™, IGlOlO™, IC2000^^ 



URIOO™, IC1400™, 
WWPSOOO^'^ FX9™, MH™ 



Compressibility Medium ffi^ Low Y^llfl—-. 

Stiff ness Medium Low High Very High 

Hard ness Medium Low Hi^^ Very High 

Typical Si stock polish, Si final poUsh, Si stocky ILD ILD CMP, STI, 

AppUcations Tungsten CMP Tungsten CMP, CMP, STI, metal dual 

post-CMPbuff metal damascene 
damascene 
CMP 



Key US 4,728,552 3,100,721 -.5,578,362 5,489,233 

Patents 4,927,432 3,763,054 -5,900,164 6,022,268 

4,841,680 

6,099,954 

Note: Suba-^^, Politex-^^, URIOO™, WWP3000™, ICIOOO-^^, ICIOIO^*^ . 101400^^ , MH™, 
OXPSOOO^^, and 102000^^ are trade-names of Rodel Inc., Pellon^** and FX9'^'^ of Freuden- 
burg, and Surfin"^^ of Fujimi. 

6.4.2 Methods of Manufacture 

Although aU the four pad types are polyurethane based, each is manufac- 
tured by a different process, iUustrating the versatility of urethane chemistry. 
Table 6.3 provides a schematic outline of the manufacturing process for the 
different pad typesTTypes 1 and 2 pads are manufactured by a continuous 
roU or web process, Type 3 pads by a batch process, and Type 4 pads by 
either a batch process or by a unit operation, net shape process. Emergmg, 
alternative manufacturing processes will be discussed later in this section. 

Each manufacturing process has its own advantages and hmitations, and 
the preferred process is largely dictated by cost and pad properties required 
for the specific poUshing application. Each process also has its own set of 
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Table 6.3. Mantifacturing Processes for Different Pad Types 



Type 1 


Type 2 


Types 


Type 4 


Needle polyester 
fibers to form 
xLon- woven felt 


Prepare 
supporting 
substrate using 
modified Type 1 
process 


Mix polyurethane 

pX Co U.i oUi O OtlXKX 

pore forming 
agent 


Form polymer 
sheet 


1^* Urethane 
Impregnation 


Coat substrate 
with polyurethane 
solution 


Cast into mold 


Apply texture 
(e.g. grooves) 


2'''^ Urethane 
Impregnation 


Coagulate 
surface film 


Cure at elevated 

temperature 


Apply PSA 


Split Impregnated Buff to open pore 
Felt to required structure and 
thickness control thickness 


Skive cake into 
individual pad 
slices 


Laminate to 
Base Pad 


Buff 


Apply PSA 


Perforate or 
groove 




Apply PSA 




Apply PSA 





Laminate to 
Base Pad 



process variables which can be used to control pad properties. The effects 
of manufacturing process variables on pad properties have been recently dis- 
cussed by Cook [13] and the reader is referred to this reference for more 
information. 

Currently, Type 3 pads, because of their higher stiffness and resultant 
abihty to planarize versus Type 1 and 2 pads, are predominantly used for 
CMP poUshing of ILD, W, Cu and STL It is expected that Type 4 pads may 
emerge as the next generation pad of choice because of their potentially less 
complex manufacturing process, resulting in improved pad to pad consistency 
and more predictable polishing performance. Data iUustrating the benefits of 
Type 4 pads will be discussed in more depth later. 

The patent literature describes several novel approaches to manufacture 
pads for CMP poUshing. The principal driving forces are to: 

1. Simphfy manufacturing processes to ensure increased pad to pad consis- 
tency 

2. Eliminate skiving and machine grooving operation 

3. Develop net shape manufacturing process capable of producing patterned 
polishing pads having a wide range of physical properties 

4. Develop processes capable of linear pads for next generation pohshers 

Table 6.4 references some of the recently issued US patents. 
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Table 6.4. US Patents describing Alternative Manufacturing Processes 



Manufacturing Approach US Patents 



Sintering of Polymeric Powders 6,017,265, 6,062,968, 6,106,754, 



6,117,000, 6,126,532 



Photopolymerization of Liquid 5,958,794, 6,036,579 
Precursors 



Net-shape Molding 6,022,268 



Extrusion of thermo-formable 5,489,233, 6,022,268 

polymers 

Each of these approaches has strengths and weaknesses. The preferred 
process depends both on the type of material being pohshed and on the tool 
platform. 

6.4.3 Pad Microstructures 

Each type of pad has a unique microstructure, as shown in the following 
SEM photomicrographs. Commercially available poUshing pads are compos- 
ite materials with properties determined by both their microstructure and 
polyurethane formulation. 

Figures 6.1 and 6.2 show the cross-section and surface microstructures 
respectively of a Type 1 pad, as exemphfied by Suba 500^^. The microstruc- 
ture is characterized by non-woven polyester fibers, partially impregnated 
with polyurethane to leave open porosity throughout the pad. By controlling 




Fig. 6.1. Cross-section of a Type 1 Pad (Suba 500*^ ) 
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Fig. 6.2. Surface of a Type 1 Pad (Suba 500'^*^) 

the type of polyurethane and the amount of residual porosity, a family of 
pads is possible differing in compressibility, hardness and stiflEness. 

Figures 6.3 and 6.4 show the cross-section and surface microstructures re- 
spectively of a Type 2 pad, as exemplified by URIOO'^^. This type of pad has 
the most complex microstructure consisting of a porous layer on a supporting 
substrate similar in structure to a Type 1 pad. Type 2 pads axe among the 
earUest pads used for polishing and their origin dates back to the Corfam"^^ 
process developed by Dupont in the 1950's to make microporous, permeable 
artificial leather [14]. It is interesting to note that real leather was used during 
the early days of semiconductor technology to poHsh siUcon wafers. Type 2 




Fig. 6.3. Cross-section of a Type 2 Pad (URIOO"^^) 
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Fig. 6.4. Surface of a Type 2 Pad (URIOO'^^) 

pads are generically known as "poromerics" arising from the fact that these 
materials are porous and polymenc. The surface of the pad consists of open 
pores which can hold and transport slurry across the pad surface. 

Figures 6.5 and 6.6 show the cross-section and surface microstructures 
respectively of a Type 3 pad, as exemplified by ICIOOO'^^. This type of 
pad is essentially a closed cell foam, where the pores are created either by 
blowing agents or by the addition of micro-balloons. Since the pads are porous 
and made by mechanically machining individual pads, the pad siuface has 
significant texture even prior to conditioning. 




Fig. 6.5- Cross-section of a Type 3 Pad (ICIOOO™) 
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Fig. 6.7. Cross-section of a Type 4 Pad. (OXPSOOO'^^) 

Figures 6.7 and 6.8 show the cross-section ajid surface microstructures 
respectively of a Type 4 pad, a. exemplified by OXPSOOO^M . ^e^of pad 
has the simplest microstructure, being non-porous and mifiUed. With such 
Dads it is known [15] that it is essential to have both macro- and microtexture 
- to aiiieve acceptable polishing. As shown in Fig. 6.7, macrotexture consists of 
grooves formed in the pad surface either by mechanical machmmg as a post- 
processing step or by net-shape processing. 

Type 3 and Type 4 pads are usually used m combmation with an un- 
derlying base-pad. The function of the base-pad is to reduce polishing non- 
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Fig. 6.8. Surface of a Type 4 Pad (OXP3000™) 

uniformity across the wafer caused by non-planarity of the top-pad and pol- 
ishing tool deficiencies. Base-pads are typically of higher compressibility and 
lower stiffness than the top-pads and thus act essentially as supporting "cush- 
ions" for the top-pad. Figures 6.9 and 6.10 show cross-sections of two types 
of base-pads commonly used for CMP polishing. Figure 9 is an example of 
an impregnated polyester felt (SubalV'^^), similar to Type 1 pads discussed 
above but with higher compressibility and porosity. Figme 10 is an example 
of a polyurethane closed cell elastomeric foam. 




Fig. 6.9. Cross-section of a SUBA IV^^ Base Pad 



180 



David B. James 




6.5 Control of Polyurethane Pad Properties 

This section wiU explore approaches available to control the properties of 
polishing pads, -nie focus will be on l^pe 3 and Type 4 pads, since they are 
conunercially of most significance, but the approadies are also applicable to 
the other pad types. Approaches are: 

i. Control of hard and soft segments 

ii. Urethane stoichiometry 

iii. Pad thermal history 

iv. Amount of porosity. 

Each of these will be discussed in detail and illustrated with examples. 

6.5.1 Hard and Soft Segments 

Previously m this chapter, the concept of haxd and soft segments in polyure- 
thanes was discussed. The type and concentration of these segments are major 
actors in controlling pad properties. TypicaUy, increasing the sof^ segment 
c^centration increa.es toughness and flexibility but reduces modulus and 
haxdness. The hard segments, which usuaUy soften at temperatures above 
ambient, improve high temperature properties and increase properties such 

as stifl&iess and strength. ^ i ^ x^^^a ^r.A 

The abihty to manipulate pad properties through control of hard and 
soft segments is iUustrated in Figs. 6.11 and 6.12. These are ternary phase 
diagra^for athree component polyurethane '^^^^'-'"-^^f^^'^^^^^^ 
the triangle represents a specific composition. Figures 6.11 and 6.12 show 
hardness and elastic modulus respectively 
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Hardness 




Fig. 6.11. Hardness as a Function of Composition 




B!<-7.75 



Fig. 6.12. Modulus as a Function of Composition 



Figure 6.11 shows that it is possible to vary haxdness from Shore D values 
of less thaa 15 to greater than 65. This covers the practical rauge of interest 
for aU types of polishing pads. In general, harder pads are used for Planar^a- 
tion of oxide dielectric layers, shallow trench isolation, and tungsten plugs 
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and conductors. SUgMly softer pads are used for polishing copper damascene 
feat^d stiU sSter pads are used in final buff polishing to remove defects 

K*^:" i?sS a similar diagram for modulus. In fact, the diagram 
shows^he logarithm of modulus, agam illustrating the broad -^a^i^ P"P- 
erties achievfble through haxd aud soft sepnent control, to 6 12 modu^ns 
varies bv three orders of magnitude from rigid, stiff pads to pads that are very 
Sle'a^d elastomeric. As will discussed later, stiffer P^^s - pre^^^^^ 
those Dolishing appUcations where planarization is important. A comparison 
S en^d 6 12 shows that hardness and modulus foUow smular behav- 
^r S comrsitioa. Although it is often the case that hardness mcreases 
^thtcreT^ modulus, as will be seen m the next section, it is also possible 
to control them somewhat mdependently. 

6.5.2 Urethane Stoichiometry 

Stoichiometry refers to the ratio of reactive f°^^^^^^,f^' 
moieties, to isocyanate groups. A stoichiomet^ of >"^'^^ ^ 
balance between isocyanate and diol groups. However as "^^t 
it is customary with polyurethanes to use an excess of isocyanate and values 
of 85 to 9^axe more Vpical- The reason for this is to taie advantage of 
L siS regions that can take place when excess ^-y^at^ pr^^ 
These are used to further control pad properties. A ratio of exactly iuu/o 
ZIl a lineax high molecular polymer [16). Since^^^^ups 
also react with moisture, exact stoiduometnc balmice ^i*^^* '° 
and consequently this mates molecular weight control difficult, especially m 

' 't^Z uZrX t effect of stoichiometry for - «P^^«^ 
oorous Tvpe 4 pad. As the stoichiometric ratio increases, the amount of excess 
Tocyanate and the nmnber of side reactions decrease. Since many of these 
rdeT^orresult in chemical bonds (known as cross-Unte) between poly- 
mt SS^The polyurethane transitions from a predominanUy thermosetting 
S^Holymrr to a more Unear polymer, having more thermoplastic be- 
havior, as stoichiometry increases. 

Table 6.5. Effect of Stoichiometry on Pad properties 





Stoichiometry (%) 


Property 


75 


85 


95 


Hardness (Shore D) 


60.5 


61.2 


63.4 


Tensile Strength (MPa) 


65.0 


67.1 


80.5 


Elongation to Break (%) 


281 


298 


459 


Modtilus (MPa) 


554 


518 


514 
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As stoichiometry increases, hardness, tensHe strength and elongation in- 
crease, consistent with a decrease in cross-links and an increase in ductility. 
Modulus decreases since cross-links increase rigidity. A comparison of the 
trend of hardness and modulus with stoichiometry shows that this is an ex- 
ample where it is possible to control hardness and modulus independently. 

6.5.3 Pad Thermal History 

The properties of polyurethanes are strongly influenced by their thermal his- 
tory. In this context, thermal history includes the temperatures and times at 
which the pads are initially cured and, where appropriate, subsequently post- 
cured. Optimum times and temperatures will depend on the specific urethane 
system. However, for thermosetting polymers, which includes polyurethanes, 
it is generally accepted [16] that increasing baking temperatures and time at 
temperature will increase the degree of cure and change physical properties. 

Table 6.6 illustrates the effect of thermal history on polyurethane prop- 
erties for an unfnied Type 4 pad. Baking conditions have been designated 
as low, medium and high. High signifies a longer bake at higher tempera- 
ture than medium, and medium versus low have the same relationship to 
one another. As baking conditions increase, pad properties such as hardness 
and modulus increase and elongation decreases. These trends are consistent 
with the formation of more cross-links between polyurethane chains produc- 
ing a network structure and, relatedly, to an increase in the glass transition 
temperature (Tg) of the polyurethane. _ 

Tg is the temperature at which the polyurethane softens appreciably. One 
method to measure Tg from the peak value of the tan delta damping curve as 
measured by dynamic mechanical analysis [17]. Tan delta is a measure of the 
damping ability of a material and is discussed in more detail later in this sec- 
tion. Figure 6.13 shows the effect of baking conditions on the tan delta curves 
for this urethane system. As bakmg conditions increase, the peak in tan delta 



Table 6.6. Effect of Thermal History on Polyurethane Properties 



Baking Conditions 



Property 


Low 


Medium 


High 


Density (g/cm^) 


1.183 


1.184 


1.184 


Hardness (Shore D) 


70.5 


71.2 


73.9 


Yield Strength (MPa) 


29.4 


31.6 


36.1 


TensUe Strength (MPa) 


63.7 


62.1 


68.0 


Elongation to Break (%) 


340 


320 


250 


Glass Itansition Temperature (°G) 


65 


68 


77 


Modulus (MPa) 


757 


806 


1050 



184 
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Tan Delta v. Baking Conditions 




shifts to higher temperatures and sUghtly broadens. At more aggressive bak- 
ing conditions, the tan delta curves will broaden further, signifying the onset 
of thermal degradation and a deterioration in pad properties. When selecting 
baMng conditions, it is important to select times and temperatures which 
optimize properties for the specific apphcation without causing degradation. 

It is appropriate at this point to make a brief mention of the technique 
of dynamic mechanical analysis to study polishing pads. Since pohshing is 
a dynamic process involving cycUc motion of both the pohshing pad and the 
wafer and since polymeric poMshmg paxis are viscoelastic materials, a valuable 
method of studying pad properties is by dynamic mechanical analysis [17]. In 
this technique, a cychc deformation is apphed to the sample at temperatures 
and frequencies that correspond to typical polishing conditions. 

Viscoelastic materials exhibit both viscous and elastic behavior in re- 
sponse to an applied deformation. The resulting stress signal can be separated 
mto two components: an elastic stress which is in phase with the strain, and 
a viscous stress which is in phase with the strain rate but 90 degrees out of 
phase with the strain. The elastic stress is a measure of the degree to which 
a material behaves as an elastic sohd; the viscous stress, the degree to which 
the material behaves as an ideal fluid. The elastic and viscous stresses axe re- 
lated to material properties through the ratio of stress to strain, the modulus. 
Thus, the ratio of elastic stress to strain is the storage (or elastic) modulus 
and the ratio of the viscous stress to strain is the loss (or viscous) modulus. 
When testing is done in tension or compression, E' and E" designate the 
storage and loss modulus, respectively. 

The ratio of the loss modulus to the storage modulus is the tangent of 
the phase angle shift (<5) between the stress and the strain. Thus, 

E"/E' = tan 6 
and is a measure of the damping abihty of the material. 
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With specific reference to polishing, energy is transmitted to the pad 
during the poUshing cycle. A portion of this energy is dissipated inside the 
pad as heat and the remaining portion of this energy is stored in the pad and 
is subsequently released as elastic energy during the poUshing cycle. 



6.5.4 Pad Porosity 

Although the morphologies of Type 1, 2 and 3 pads differ, they all contain 
porosity which plays a key role in determining pad physical properties and 
polishing functionality. For these pads, porosity is needed to retain slurry 
on the pad surface and to distribute slurry uniformly across that surface. 
In this section, the importance of porosity wiU be iUustrated using data for 
ICIOOQTM IC2000™. tm . i . 

Table 6.7 shows typical physical properties of ICIOOO^^ , Style 5. 

Many of these properties are strongly dependent on the porosity present 
in the pad. For these pads, porosity is difficult to measure directly with 
a high degree of accuracy. Instead, porosity is most conveniently determined 
quantitatively by measuring pad density, since the two are related by the 
equation: 

Density of Bulk Polymer - Density of Pad 
Volume Fraction Porosity = Density of Bulk Polymer ' 

Table 6.8 shows the statistical correlations between pad density and other 
physical properties. It is apparent that the correlations are very high (±1.00 
being a perfect correlation). Since many properties depend on pad porosity 
and are not independent of one another, as wiU be seen in a later section, 

Table 6.7. Typical Physical Properties of ICIOOO™ 



Property 


Value 


Porosity (# of cells) 


880 ± 120 


Density (g/cm^) 


0.748 ± 0.051 


Hardness (Shore D) 


52.2 ±2.5 


Shear Strength (MPa) 


51.2 ± 4.1 


Proportional Limit (MPa) 


9.1 ±1.3 


Tensile Strength (MPa) 


21.6 ±2.8 


Elongation to Break (%) 


175 ± 20 


Storage Modulus (MPa) 


310 ± 40 


Loss Modulus (MPa) 


28.0 ±4.5 


Tan Delta 


0.090 ± 0.005 
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Table 6.8. Relationship between Pad Density and other Physical Properties 
Pad Physical Property Correlation Coefficient 



Porosity 


-0.77 


Haxdness 


0.94 


Sheax Strength 


0.84 


Proportional Limit 


0.88 


Tensile Strength 


0.96 


Elongation to Break 


0.80 


Storage Modulus 


0.89 


Pad Stiflhess 


0.88 



relating specific pad properties to poHshing performance is made even more 

"""Tigmes 6.14 and 6.15 graphically show the Unear relationships of density 
with haidness and storage modulus respectively. 

It has been shown above that porosity has a strong influence on other pad 
properties. A consequence of this relationship is that porous pads have more 
variabihty in physical properties than equivalent non-porous P^^; This may 
be illustrated by compaxing the physical property distributions of IplOOO 
and IC2000^^. IC2000™ is very similar to IGIOOO^^ and made by the 
same manufacturing process but is non-porous. Typical physical properties 
of 102000"^^ are shown in Table 6.9. 

Figure 6.16 compares the key physical properties for the two pad types 
in terms of the ratio of standard deviation to property average [18, 19]. It is 
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Fig. 6-15. Storage Modulus versus Density for ICIOOO'^^ 
Table 6.9. Typical Physical Properties of IC2000^^ 



Property 


Value 


Porosity (# of cells) 


0 


Density (g/cm^) 


1.180 ± 0.002 


Haxdness (Shore D) 


73.0 ± 1.0 


Yield Strength (MPa) 


33.4 ±1.4 


Tensile Strength (MPa) 


75.0 ±2.5 


Elongation to Break (%) 


335 ± 20 


Storage Modulus (MPa) 


850 ±61 


Loss Modulus (MPa) 


87.0 ±4.0 


Tan Delta 


0.103 ± 0.005 




clearly evident that for non-porous pads the variability in density is signifi- 
cantly less and that a similar trend is present m the other properties shown. 

An advantage of non-porous pads is thus a much tighter distribution 
of physical properties. Although the relationships between pohshing perfor- 
mance and pad physical properties are not well understood, a tighter prop- 
erty distribution should translate into more consistent polishing performance. 
This may be illustrated for oxide pohshing using the two pad types [18]. Fig- 
ure 6.17 shows the distribution of removal rate for a large number of wafers 
poHshed using either ICIOOO™ or 102000"^^. Conditioning was kept con- 
stant for the two pad types. The distribution of removal rate is much tighter 
for the non-porous pad, consistent with the above discussion. 

In addition to the benefit of more consistent pad physical properties and 
less variabihty in pohshing removal rate, there are other benefits for non- 
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Physical Property Variability of iCIOOO and IC2000 
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Fig. 6.1T. Effect of Pad Porosity on Polishing Removal Rate Distribution 

porous pads. Typically, as may be seen by comparing the property data 
for ICIOOO'^*^ and IC2000^^ shown in Tables 6.7 and 6.9 respectively, non- 
porous pads have a higher modulus. This translates to improved planariza- 
tion of die-scale features during poUshing. Abrasion resistance of non-porous 
pads also tends to be higher which results in less pad wear during pohshing 
and hence longer pad-Ufe. Another benefit of non-porous pads, which has 
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been demonstrated for copper CMP poUsMng l^'^^^^^f;''^^^^^^ 
PnrP. or, the Dad surface can trap polishing debris which can either scratch 
rrilt^^ if the features axe soft materials like copper, or - 

leave residual particles on the wafer surface. challenee is 

There are also challenges with non-porous pads. The prunaxy cii-U^^g^ is 

that nr-porous pads require more rigorous pad break-m to achieve optmiuxn 

that non P°^°^P !? _._ni_jjaio^ [15] that for effective pohshmg, the 

JJo-texture. The UUer is discussed 

L de^tton^t seotioa. Mteo-texture refers to the roushiress o£ the pad 
t rr^^d presence of asperities which contact tne water aurmff poush- 

suiface and the presence o roughened using xypiceliy 

Xr^^reS: aStXlg is stoppled during polishing 

7"d — r^^ldtCUer edge^This reduces 
L wafer surface. Edge effects have been dacussed by B^er [22] and re 
ated to the ph^ical properties of the top and base layers of a pohshmg pad 
nllX modetog work, and subsequent experimental vahdatron. shows that 
Jte stXti of tto top pad increases, edge effects also increase^Smce non- 
poroi e stiffer"an corresponding porous pads, edge eSects wo^ 

poroub nroblem with the non-porous pads and tms 

K 1 11 Itoinated by the use of retaining rings on the wafer earner that 
wi^r^oS^ wr^h the wafer surface. This approach has been nn- 
are 'i^^, .^results by several pohshing tool manufacturers 

^^tC^oa^ to — e edg^ effects is through the 1— us^f 
t^'roovL in the top pad which reduce pad stffljess m a controEed way 
without compromising die-scale feature planaxity [62]. 

6.6 Control of Pad Properties Through Pad Geometry 

The nievious section discussed control of pad properties through the pad 
l^tha^rrm tly and formulation. This section dismisses ti^er^^^^ 
rnarprtperties by physical control of the pad geometry. This mcludes 
^Mn o pad'tMcLess, groove designs, pad shapes, a.d base pad. 
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6.6.1 Pad Thickness 

Top pad thickness is important, since it determines the stiffness of the pad, 
given that pad stiffness is proportional to the product of pad modulus and 
cube of the thickness [23]. Thus doubhng the pad thickness increases stiffness 
eight-fold. Polishing pads used for CMP are typically about 1.3 mm (50 mil) 
thick- Pad stiffness controls several important pohshing parameters, including 
uniformity of removal rate across the wafer, die level planarity, and to a lesser 
extent dishing and erosion of featiures within a die. In order to planarize 
next generation devices, pad thicknesses greater than 1.3 mm (50 mil) are 
preferred, and 2.0 mm (80 mil) thick pads are now being used routinely by 
several major semiconductor manufacturers. Above about 5 mm (200 mil), 
polishing uniformity may suffer because of the inability of the pad to conform 
to variations in global wafer flatness. 

As a polishing pad wears, the overall pad thickness and corresponding 
stiffness decrease. This again argues for a high initial pad thickness, as the 
change in stiffness with polishing time will be relatively less for a thicker pad. 
Additionally, since stiffness is less dependent on the grooved thickness, which 
is removed during pad use, rather than the underlying ungrooved region, 
high thickness for the underlying ungrooved layer and for the overall pad are 
preferred. 

For a given pad thickness, increasing pad modulus wUl increase pad stiff- 
ness and the ability of the pad to planarize. Thus, as mentioned previously, 
unfilled pads will planarize more effectively than filled pads. However, it is 
important to recognize that stiffiaess is proportional to the cube of thickness 
compared to only the single power of modulus, so that changing pad thickness 
can have a more significant impact than changing pad modulus. 

Figure 6.18 [64] shows the effect of pad thickness on oxide planarity for 
a Type 4 experimental pad (OXPSOOO'^^ from Rodel). The polisher was 
a Strasbaugh 6DS-SP and the slurry was ILDISOO^^ also from Rodel. The 
figure shows planarity expressed as "planarity quotient" for different feature 
sizes. Planarity quotient is defined as the oxide removal rate at the bottom 
of a feature trench divided by the oxide removal at the top of a feature. 
Thus, the smaller the planarity quotient, the more efficient the planarization 
process. Since smaller features are more readily planarized, planarity quo- 
tient increases with increasing feature size. For a given value of planarization 
quotient, the feature size corresponding to that value may be defined as the 
"planarization distance". Longer planarization distances indicate enhanced 
ability of a pad to planarize. 

Figure 6.18 shows that for a given value of planarization quotient, pol- 
ishing with a 2.0 mm (80 mil) thick pad results in a higher planarization 
distance versus the 1.3 mm (50 mil) thick pad. For example, at an arbitrary 
planarization quotient of 0.3, the 2.0 nnn (80 mil) thick pad has a planariza- 
tion distance of about 40% longer than the distance for the 1.3 mm (50 mil) 
thick pad (3.9 versus 2.8 nun). 
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Fig. 6.18. Effect of Pad Thickness on Planarity 

As discussed in the next section, grooving the surface of poUshing pads 
also affects pad stiffness. In general, the deeper the grooves with respec^ t^^ 
pad thickness and the closer the grooves are to one another, the more flexible 
the pad. 



6.6.2 Groove Designs 



PolisMng pads used for chemical mechanical polishing typically have macro- 
textme 1 ms car. be either perforations through the pad tnickness or surtace 
PToove oesigns. Sucix surface desi^s include, but are not limited to, circular 
grooves wMch may be concentric or spiral grooves, cross-hatched patterns 
arranged bs aa X-Y grid across the pad surfax^e, other regular designs such 
hexagons, triangles and tire-tread type patterns, or irregular designs such a^ 
fractal patterns, or combinations thereof. The groove ^^o^^ ^-{^^ ^^^l^^^ 
gular with straight sid^walls or the groove cross-section may be V -sha.ped, 
"U"-shaped, triangular, saw-tooth, etc. Further, the geometric center of cir- 
cular desi^s may coincide with the geometric center of the pad or may ^e 
offset. Also the groove design may change across the pad surface. The choice 
of design depends on the material being polished and the type of pohsher, 
since different pohshers use different size and shape pads (i.e. circular versus 

^'""^Table 6 10 provides an overview of the multitude of groove designs that 
have appeared in the US patent hterature over the last ten years. 
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Table 6.10. US Patents covering Groove Designs 



us Patent Ntunber 


Description of Groove Design 


5,020,283 


Achieve constant surface area across pad using circular 


voids, holes, squares and ray designs 


5,177,908 


Sun-burst pattern of non-tapered rays 


5,216,843 


Circumferential triangular macro-grooves and 


conditioning microgrooves 


5,297,364 


Circular voids to control non-uniformity 


5,329,734 


Regions of different circular pore density 


5,394,655 


Annular rings with tapered cross-section 


5,489,233 


Non-porous pads having macro and micro-channels 


5,578,362 


Type 3 pads with circular and X-Y groove designs 


fi fiOQ 719 


X-Y channels with slurry recesses at intersections to 


minimize edge exclusion 


5,628,862 


Molded pad smrface comprising channels between 


hemispherical features . 


5,645,469 


Radially extending tapered channels and circumferential 




grooves 


5,650,039 


Off-center circular spiral groove 


5,690,540 


Inward spiral from pad periphery to center 


5,725,420 


Combination of holes and X-Y grooves where groove pitch 


> hole pitch 


5,778,481 


Spiral, swirl or concentric raised areas direct fluids from 


pad center to edge 


5,842,910 


Circumferential grooves offset from pad geometric center 


5,888,121 


Regions with different circular and X-Y groove dimensions 


5,900,164 


Type 3 pads with circular and X-Y groove designs 


5,921,855 


Circular, spiral or concentric grooves 


5,984,769 


Regions with different circular groove dimensions 



Grooves are added to polishing pads used for CMP for several reasons: 

1. To prevent hydroplaning of the wafer being polished across the surface 
of the pohshing pad. If the pad is either ungrooved or unperforated, 
a continuous layer of polishing fluid can exist between the wafer and pad, 
preventing uniform intimate contact and significantly reducing removal 
rate. 

2. To ensure that slurry is uniformly distributed across the pad surface [24] 
and that sufficient slurry reaches the interior of the wafer. This is espe- 
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daily important when pohshing reactive metals such as copper, m winch 
the chemical component of polishing is as critical as the mechanical. Urn- 
form slurry distribution across the wafer is required to achieve the same 
polishing rate at the center and edge of the wafer. However, the thickness 
of the slurry layer should not be so great as to prevent direct pad-water 

3 To^'lontrol both the overaU and localized stiffness of the pohshing pad. 
' This controls pohshing uniformity across the wafer surface and also the 
ability of the pad to level features of different heights to give a highly 
planar surface. Different regions of the pads may have different groove 
designs. 

4. As a subset of 3, to reduce edge effects. , , . ^ , 

5 To act as channels for the removal of polishing debris from the pad sur- 
' face A build-up of debris increases the MkeHhood of scratches and other 

defects. This is related to 2, since as new slurry replaces the old, the old 

removes the entrained debris. 

It is known [15, 25] that one factor determining pad-life of grooved pads is 
the depth of the grooves, since acceptable polishing performan^ is possible 
only until the pad has worn to the point where grooves have insufficient depth 
to distribute slurry, remove waste, and prevent hydroplaning For Type 3 pads 
containing porosity, the minimum groove depth to prevent hydroplaning is 
awT4l and ior Type 4 non-porous unfilled pads about 5 mil n.order 
to achieve the combination of acceptable pad stiffiaess and long pad-hie, it is 
necessary to have deep grooves but also sufficient remaining pad to provide 
stiffness As ^oove density and groove width increase, pad stiffness becomes 
more dependent on the thickness of the remaining ungrooved layer of the 
pad, rather than on groove depth alone. 

6.6.3 Pad Shapes 

The patent hterature contains many examples of novel pad shapes which 
have been proposed to improve pohshing performance^ Although many of 
these concepts are untested, they illustrate some of the options that may 
be used to optimize pohshing performance through control of pad geometry 
Table 6 11 summarizes some of the key patents covering pad shape. Ihis list 
is by no means complete but iUustrates some of the approaches that are being 
considered. 



6.6.4 Base Pads 



IdeaUy, for uniform pohshing, removal rate should be the same at all points 
on the wafer surface. This would suggest that the pad needs to be m contact 
with the whole wafer surface with the same contact pressure and relative 
velocity between pad and wafer at aU pomts. Unfortunately, wafers are not 
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Table 6.11. US Patents related to Pad Shape 



US Patent # Assignee 



General 
Area 



Brief Description 



5,310,455 
5,516,400 
5,624,304 



LSI 



Extended A lower pad is mounted to the platen and is 
Pad Edge trimmed to the size of the platen. An upper pad is 
mounted to the lower pad, and is sized so that an 
extreme outer edge portion of the upper pad 
extends beyond the trimmed outer edge of the 
lower pad. The outer edge portion of the upper pad 
is deformed downwardly, towards the lower pad. In 
this manner, polishing slurry is diverted from the 
pad-to-pad interface. Additionally, an integral 
annular lip can be formed on the front face of the 
upper pad, creating a reservoir for slurry to be 
retained on the face of the upper pad for enhancing 
residence time of the polishing slurry prior to the 
slurry washing over the face of the upper pad. 



5,234,867 
5,421,769 



Micron 



Non- 
circular 
Pads 



A polishing head mechanism moves the polishing 
head and semiconductor wafer across and past 
a peripheral edge of a non-circular pad to 
effectuate a uniform polish of the semiconductor 
wafer surface. 



5,558,563 IBM Raised A polishing pad which includes raised portions is 

Areas used to apply varying amounts of pressure. In 

addition, the position, size and height of the raised 
portions are used to affect the amount of pressure 
applied. There are several possible methods to 
produce raised areas in the polishing pad. In 
particular, shims can be added to the polishing 
table or the polishing table can be machined so 
that the polis hing table includes raised portions. 

5,785,584 IBM Raised The rotating polishing pad is caused to flex 

5 934 977 Areas upward as it passes over a discrete, non-continuous 

"bump formed in the surface of an underlying 
stationary platen. This deflection in any given 
portion of the pad is a transient condition as the 
raised portion of the pad will fall back from to its 
original configuration once it clears the underlying 
bump. Thus, portions of the pad will be 
continuously progressed up, over and back down to 
create a discrete raised portion in the polishing 
pad surface at a fixed location. 



5,888,126 Ebara Raised The abrasive cloth has a projecting region on 

Areas a surface thereof for more intensive contact with 
the workpiece than other surface regions of the 
abrasive cloth. Projecting regions on the pad 
surface are effected by creating projecting regions 
in the underlying upper surface of the turntable 
using actuators which may be electromagnetic, 
piezoelectric or compressed air. 



5,769,699 Motorola Multiple 

Regions 



The polishing pad has a first region that is closer 
to the edge of the polishing pad and a second 
region adjacent to the first region and further from 
the edge of the polishing pad. The polishing pad is 
configured, so that the second region is thicker or 
less compressible compared to the first region. 



5,738,567 Micron Multiple The polishing pad has a polishing body and 
5 910 043 Regions a cleaning element positioned in the polishing 

' ' body. In operation, the cleaning surface 

periodically engages the wafer when the wafer is 
engaged with the pad to remove residual materials 
from the surface of the wafer. 
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Table 6.12. Patents describing Multiple Layer Pads 



Patent # 


Assignee 


Brief Description 


US5,212,910 


Intel 


Three layer composite pad, comprising a top 
polishing layer, an intermediate stiff layer 
which may be segmented to create a 
"bedspring" effect, and a bottom elastic 
cushioning layer. 


US,5257,478 


Rodel/ 
Westech 


Pad has at least two layers, wherein the base 
layer is more compressible than the top 
planarizing layer. 


US5,287,663 


National 
Semiconductoi 


Three layer composite pad, comprising a top 
polishing layer, an intermediate rigid layer and 
a bottom resiMent layer. 


US5,564,965 


SEH 


Three layer composite pad, comprising a top 
polishing layer, an intermediate rigid layer and 
a closed-cell foam of soft rubber as the bottom 
layer. 


US5,664,989 


Toshiba 


Two layer polishing pad, wherein the base 
layer consists of fine bags hermetically sealed 
with fluid to provide uniform pressure 
distribution across tne waier. 


US5,871,392 


Micron 


Base pad contains a plurahty of thermal 
conductors to conaucu jp.eaii irom tne poubiuiig 
layer to the platen. 


US5,876,269 


NEC 


Two layer poUshing pad comprising an upper 
polisiiing layer naraer tnan tne lower Lcxy^L 


US5,893,755 


KomatsTi 


Discloses siUcon rubber base pad to reduce 
waviness during poUshing. 


US5,899,745 


Motorola 


Base pad has an edge and central portion, such 
that the compressibility of the center portion 
is higher than that of the edge. 


US5,899,799 


Micron 


Grooved base pad which creates depressions in 
the upper pad for control of slurry flow across 
the poUshing pad. 


EPA845,328 


Sumitomo 


Three layer composite pad, comprising a top 
porous poUshing layer, an intermediate 
support layer and a bottom resilient layer. 


EPA919,336 


Speedfam 


Two layer pad comprising a top pad over 

a more compressible base pad, such that both 

axe uniformly permeable to polishing fluid. 
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perfectly flat and typically have some degree of curvature resulting from the 
stresses of manufacture and differing coefficients of thermal expansion of the 
various deposited oxide and metal layers. This requires the polishing pad to 
have sufficient flexibility to conform to wafer-scale flatness variability. One 
solution to this problem is to laminate a stiff pohshing pad to a flexible under- 
lying base pad, which is typically a more compressive, foam-type polymeric 
material This improves polishing uniformity across the wafer without unduly 
compromising the stiffness of the polishing top pad. 

Several patents have recently been issued disclosing the use of pohshing 
pads, wherein the top pad, which contacts the wafer, is laminated to at least 
one underlying pad. This base pad is typically more compressible than the 
top pad. In some cases, a thin but stiffer intermediate layer is sandwiched 
between top and base pads. Table 6.12 summarizes some of the main patents 
covering multiple layer pads. 

Table 6.13 shows physical properties of two commonly used base pads. The 
microstructures of these base pads have been shown previously in Figs. 6.9 
and 6.10. Suba IV'^^ is a polyurethane impregnated felt and the other base 
pad is a closed cell polyurethane foam. 

As discussed previously, Suba IV'^^ and foam base pads have very dif- 
ferent microstructures. The effect of microstructure on physical properties is 
that the properties of Suba IV'^^ are anisotropic, whereas those of the foam 
are isotropic. The properties shown above are measured through the thick- 
ness of the pad rather than m the plane of the pad. The porosity in Suba 
jyTM essentially open which can lead to wicking of slurry into the base 
pad. 

Figure 6.19 [64] shows the effect of base-pad on the planarity of oxide 
features, polished with a Type 4 experimental pad (OXPSOOO'^^ from Rodel). 
The pohsher was a Strasbaugh 6DS-SP and the slurry was ILDISOO'^^ also 
from Rodel. Three sets of data are shown, corresponding to no base pad, 
Suba IV'^^, and an experimental foam base pad similar in microstructure 
and properties to the foam base pad discussed above. 

It is clearly evident that the best planarity is achieved with no base pad. 
However, pohshing uniformity across the wafer is poor caused by imperfect 



Table 6.13. Typical Physical Properties of Base Pads 



Physical Property 


Suba 


jyTM 


Foam Base Pad 


Thickness (mm) 


1.3- 


-1.4 


1.3- 


-1.5 


Density (g/cm^) 


0.27- 


-0.32 


0.43- 


-0.53 


Hardness (Shore 0) 


65- 


-75 


45- 


-55 


Compressibility (%) 


6- 


■10 


2- 


-6 


Rebotind (%) 


90- 


-95 


>90 
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Fig. 6.19. Effect of Base Pad on Planarity 



wafer flatness and thickness, as well as physical hmitations of the top pad 
and pohshing tool. Using a base pad compromises planarity but improves 
uniformity across the wafer by reducing the impact of the other problems. The 
experimental foam base pad has lower compressibility than Suba IV™ which 
increases the abihty to planarize. Thus at a given planarization quotient, the 
planarization distance of the foam base pad is longer than that of Suba IV'^^. 

The base pad also has an impact on pohshing non-uniformity at the edge 
of the wafer. As mentioned previously, pohshing non-uniformity at the wafer 
edge depends on both the stiffness of the top pad and also on the compressibil- 
ity of the base pad. The so-called "edge effects" result in less material being 
removed near the wafer edge. The problem becomes worse as the stiflhess of 
the top pad increases and the compressibility of the base pad increases [22]. 
In order to achieve good planarization behavior with minimal edge effects, 
the preferred pad structure will comprise a stiff top pad over a base pad with 
just sufficient compressibihty to ehminate polishing tool imperfections. 



6.7 Relationships Between Pad Properties 
and Polishing Performance 

The relationships between pad properties and pohshing performance are com- 
plex and not fuUy understood. One reason for the complexity is that pol- 
ishing performance is not categorized by a single parameter but by several 
parameters which are dependent on the scale of the features being polished. 
Table 6.14 categorizes polishing into three levels - wafer-, die- and feature- 
scale. 

Wafer-scale refers to polishing across the whole wafer surface. Currently 
wafers are typically 200 nun in diameter but the industry is beginning to 
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Table 6.14. Characterization of Polishing Performance Parameters 



Polishing Performance Scale 

Wafer-Scale Die-Scale Feature-Scale 

Removal Rate (RR) Planarization (P) Conductor Dishing (CD) 

Non-Uniformity (NU) Defectivity (D) Oxide Loss (O) 

Edge Effects (EE) Selectivity 

Macro-scratches (MS) Defectivity (D) 

Pad Life (L) Roughness (Rg) 



gradually transition to 300 nam. Die-scale polishing applies to polishing across 

the area of a die, many of which populate the surface of a wafer. The third 
level, designated feature-scale, refers to the polishing of conductor lines, bond 
pads, posts and other features within the die. Such features have dimensions 
measured in sub-microns and microns. For each scale level, Table 6.14 shows 
the polishing parameters which are most important for that level. In most 
cases, the meaning of each of these parameters is self-evident and further 
definition will not be provided here. The reader is referred to other sections 
of this book for more details. 

A second factor complicating an understanding of the relationships be- 
tween pad properties and polishing performance is the strong dependence of 
one pad physical property on another. For example, it has been discussed 
previously that pad density strongly correlates with other properties such as 
hardness and modulus. Thus, it is very difiicult to isolate one property and 
look at its effect on pohshing performance without simultaneously changing 
other pad properties. 

Thirdly, polishing performance depends on many factors other than just 
the pad. These include polishing tool set-up (e.g. platen and carrier speeds, 
down-force, back-pressure, carrier head design and carrier film, etc.), slurry- 
related variables (e.g. abrasive type and loading, pH, flow-rate, etc.), and pad 
conditioning (e.g. diamond density, exposure and placement, sweep profile, 
and down- force, etc.). How the pad conditions and the resultant topography 
of the pad conditioned surface also depends on the properties of the pad itself, 
as well as the design of the conditioner. In determining relationships between 
pad properties and performance, the other polishing variables must be kept 
constant, since many of these have a major impact on polishing results, which 
in some cases may be greater than the contribution from the pad itself. 

Given the above caveats, this section attempts to relate the polishing per- 
formance characteristics described in Table 6.14 with specific pad properties. 
Only the major relationships will be discussed, as summarized in Table 6.15. 
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Table 6.15. Relationship between Pad Properties and Polishing Performance 





Polishing Scale 




Pad Property 


Wafer 


Die 


Feature 


Conditionabuity 


Density 
(Porosity) 


RR, NU 


D 


CD, 0 


Yes 


Hardness 


MS 


D 


D, Rg, CD, 0 


Yes 


Tensile 


L 






Yes 


Properties 










Abrasion 


L 






Yes 


Resistance 










Modulus 
(Stiffness) 


EE, NU 


P 




Yes 


Thickness 


L 








Top Pad 
Compressibility 




P 


CD 




Base Pad 


EE, NU 


P 






Compressibility 










Pad Texture 


L, RR, NU, EE 








(Grooves) 










Pad Roughness 


RR, NU 


P 


CD, 0 


Yes 


Hydrophilicity 


RR 






Yes 



6.7.1 Removal Rate 

The rate of removal of the material being polished depends on many fac- 
tors iacluding both the macro and microtexture of the pad, and also the pad 
physical properties. Macrotexture is achieved by either punching perforations 
through the pad or by forming g?:ooves into the pad surface. The latter has 
been extensively discussed in aa earlier section of this chapter. A key role of 
macrotexture is to prevent hydroplaning of the wafer across the pad surface, 
axising from an excessively thick layer of polishing fluid under the leadmg 
edge of the wafer which limits intimate contact between pad and water and 
simificantly reduces removal rate. Levert et al. [26] have studied the thickness 
of the slurry fihn for different pad types and shown that hydroplamng is po- 
tentially more of a problem with impermeable (Type 4) and semi-perm^ble 
(Type 3) pads, rather than with permeable pads (Types 1 and 2). Thus 
macrotexture becomes more critical for Type 3 and is an absolute necessity 
for Type 4 pads. Indeed, it has been shown that lack of macrotexture with 
Type 4 pads results in no removal [15]. It ha^ also been reported [27] for 
Type 3 pads that grooves give higher removal rates and more stable removal 
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than perforations. The reason was attributed to grooves in the poUshing pad 
surface facilitating slurry flow to the wafer surface during poUshing. Grooves 
also enable slurry to be squeezed back out from the leading edge of the wafer, 
thus eliminating the potential for hydroplaning. 

Removal rate also depends on pad microtexture which may be loosely 
defined as the localized roughness of the pad surface. Microtexture comes 
from both pad conditioning and from porosity within the pad. Thus other 
factors being held constant, Type 4 pads which have no inherent porosity 
give lower removal rates than Type 3 pads [18, 28]. 

Pad conditioning has a strong effect on removal rate and removal rate sta- 
bihty and has been studied extensively [13, 28, 29, 30, 31, 32, 33]. As discussed 
in the first chapter, it is known that either in-situ or ex-situ conditioning of 
the pad surface is required to achieve and maintain stable removal rates. The 
role of conditioning is to create asperities on the pad surface which contact 
the wafer. Without conditioning, the asperity population decays during pol- 
ishing and removal rates rapidly decline. The rate of decline is usually higher 
for Type 4 than Type 3 pads, since in Type 4 pads there are no inherent 
asperities from a pore structure, such as that found in Type 3 pads. 

Cook [13] and Hetherington [32] have independently used scanning elec- 
tron microscopy to study changes in the surface of IC2000'^^ (Type 4) and 
101400'^^ (Type 3) pads respectively during pohshing in the absence of con- 
ditioning. For both pad types, the photomicrographs clearly showed a pro- 
gressive decrease in pad asperities and the increasing formation of a smooth 
mesartype structure. Hetherington postulated that the asperities were de- 
forming under the shearing conditions associated with pohshing, whereas 
Cook favored a progressive abrasive smoothing of the asperities. It is proba- 
ble that given the conditions of pohshing both shear deformation and abrasive 
loss are occurring. 

Bajaj et al. [28] and Oliver et al. [33] have also studied the relationship 
between pad properties and removal rate decline in the absence of condition- 
ing. During polishing, pad surface roughness decreases because the shearing 
forces of polishing reduce the height and number of pad asperities. The rate 
at which the asperity population declines was shown to be inversely propor- 
tional to the shear modulus of the bulk polymer. 

6.7.2 Non-Uniformity 

Removal rate uniformity across the wafer surface (within wafer non-uni- 
formity) also depends on the macro and microtexture of the surface of the 
pohshing pad in contact with the wafer. However, as previously discussed, be- 
cause of edge effects non-uniformity also depends on the base pad and the bal- 
ance of properties between top pad stiffness and base pad compressibility [22] . 

Goetz [34] has studied the effect of the base pad construction on pla- 
narization at intra-die, die, and wafer length scales using contact and plate 
bending mechanics to determine the pressure variations due to loading. Both 
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single-layer and two-layer sub-pad constructions were analyzed. As expected, 
the effect of sub-pad construction was greatest in the multi-miUimeter length 
scales (die to wafer scale) and attenuated by the stiffness of the upper pol- 
ishing layer. Single and two-layer sub-pads responded in different ways to 
loading, such that the response of the single-layer sub-pad was dominated by 
contact mechanics, while the two-layer sub-pad response was dominated by 
plate bending. 

Polishing non-uniformity across the wafer is also strongly dependent on 
the polisher design, especially the carrier head. Base pads were originally 
used to overcome Hmitations with the polishing tool. As pohsher designs 
become more sophisticated, the need for base pads will decrease and polishing 
uniformity across the wafer surface, including the edge exclusion region, will 
improve. This trend will be especially important as the industry transitions 
to larger diameter wafers. 

6.7.3 Pad Life 

Pad hfe is a complex issue since several factors can determine the useful life 
of a pad. Also, different manufacturers have different criteria for determining 
pad life. However, it is usually determined by the point at which some specific 
polishing parameter exceeds the control limit for that parameter. Such pa- 
rameters can be removal rate, non-uniformity or, for shallow trench poUshing, 
selectivity of oxide to nitride. Two events which commonly limit pad hfe are 
glazing of the pad surface and loss of macrotexture. Glazing is the build-up 
of pohshing debris on the pad surface which clogs the pores [28, 33, 35, 36] 
and becomes increasingly difficult to remove by further conditioning. This 
problem is often seen, especially with Type 1 and 3 paxis. The second event 
involving loss of macrotexture results from the abrasive wear of the pad dur- 
ing pohshing, especially from the pad conditioning process, which reduces 
the overall pad thickness and the depth of the grooves in the pad surface. 
Once the grooves have been abraded, hydroplaning will occur for the reasons 
discussed above. Secondly, as the thickness of the top pad decreases during 
polishing, the pad will become less stiff and polishing planarity will decrease. 
Pad hfe may be increased by cutting deeper grooves [15, 25] and by increasing 
pad thickness so that pad stiffness is not compromised. 

6.7.4 Planarity 

At the die-level, achieving planarity of the features is an important concern 
and a main driving force for CMP polishing as a device fabrication step. It 
has been recognized for sometime [37, 38, 39, 40, 41] that stiffer and harder 
pads give improved planarity. As has been discussed previously, for Type. 
3 pads stiffness and hardness are often strongly correlated to one another 
and it is not easy to determine which has the greater unpact on planarity. 
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However, it is becoming generally accepted that stiflEness of the polishing pad 
is the more important property determining planarity. As discussed earlier, 
pad stiffness depends on both the pad modulus and thickness. 

Experimentally, the relationship between planarity and pad stiffness has 
been independently verified by several researchers [42, 43, 44]. For example 
in an oxide CMP process, the planarization performance of a Type 3 pad 
(ICIOOO'^^) was compared to that of a Type 4 pad (102000'^^). With other 
poHshing parameters kept constant, the pohshing data clearly demonstrated 
that the stiffer IC2000™ pad produced improved planarity. Other work [41, 
44], and data presented earlier in this chapter, have shown that increasing 
the pad thickness produces a similar response and that less compressible or 
thinner base pads also improve planarity. 

Simplistically, the improved planarity achieved by using stiffer pads comes 
from the reduced bowing of the pad as it planarizes. This phenomenon has 
been modeled by several groups [41, 44, 45], basically by treating the polish- 
ing pad as a beam in deflection. Nanz and Camilletti [46] have summarized 
the various modeling approaches and critiqued the strengths and weaknesses 
of each model. Steigerwald et al. [7] have also reviewed models describing 
planarity. 

Grillaert et al. [41] have analyzed oxide thickness variation after CMP 
within a die (WIDNU) for two extreme cases: perfect pad bending and no pad 
bending. The perfect pad bending model consisted of a thin top pad on a soft 
bottom pad. In this case, the top pad bends easily and large oxide variation 
was found. The no pad bending case, consisting of a thick inflexible top pad, 
significantly reduced oxide thickness variation. Experimentation confirmed 
that WIDNU for a real stack pad always falls between these two extreme 
cases. 

There is also growing evidence that pad roughness has an effect on pla- 
narity. Cook et al. [47], in attempting to analyze the physics, mechanics and 
chemistry of the CMP process, have concluded that the best descriptor of that 
process is an asperity contact model. The authors found that pad roughness 
influences the limits to achievable planarity, such that planarity was degraded 
as pad roughness increased. 

Renteln and Coniff [48] have related pad roughness back to pad modulus 
by hypothesizing that roughness created on the surface of the pad can be 
treated like an additional elastic layer. This can be modeled as a two-layer 
composite structure where the overall modulus depends on the volume frac- 
tions (thicknesses) and moduli of the individual layers. Since the surface layer 
will have a more open pore structure than the bulk, it is reasonable to assume 
that its modulus will be lower than that of the underlying layer. Using this 
model, Renteln and Coniff were able to explain the experimental data. In the 
same paper, they also postulated that planarization rate depends solely on 
the elastic modulus characteristics of the pad and that anelastic effects of the 
pad do not play a direct role. 
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Using a different approach to explain the dependence of planarity on pad 
roughness, Yu et al. [49] have developed a statistical asperity model which 
relates the time-dependent deformation of asperities to the planarization of 
device features. The model assumes that the applied load is distributed be- 
tween the asperities and the fluid film which fills the gap between the polish- 
ing pad and the wafer. At low platen speeds the hydrodynamic pressure of 
the fluid film is small and the load is principally carried by the asperities. The 
model also relates asperity size to trench widths. For wide trenches where the 
trench is larger than the asperity, the asperity can contact and remove mate- 
rial from the trench bottom. However, for narrower trenches, device geometry 
restricts asperity contact and poHshing is selective. For wide trenches, geom- 
etry effects become insignificant and pohsh selectivity disappears. In order 
to match better the model predictions with experimental data, the authors 
proposed that, since the pad material is viscoelastic, an asperity does not de- 
form instantaneously as it enters or leaves a trench. In consequence, contact 
to the trench bottom is reduced and polish selectivity is increased. 

Grillaert et al. [50] have developed a model which attempts to explain why 
step height as a fimction of polishing time initially decreases linearly followed 
by an exponentially decreasing reduction as planarity is achieved. Also, their 
model predicts the influence of pattern dependence on step height reduction. 
The model assumes that the applied load is only supported by the features 
in contact with the pad, the pad has limited compressibiHty, the pad bends 
perfectly, Preston's equation [51] can be extended to local removal rates and 
pressures, and that the pad is perfectly 'smooth. Since the pad has limited 
compressibility, during initial polishing the entire load is supported by the 
up features only and removal rate of these features is high. As long as the 
step height is high enough the pad does not touch the down features and no 
material is removed there. As polishing proceeds step height reduces. At some 
point, the step height is small enough so that the pad comes into contact 
with the down featiues. When this occurs the pressure will be distributed 
over both up and down features and the rate of further step height reduction 
will decrease. Since the pad surface is not perfectly smooth but is comprised 
of asperities, the protruding asperities can remove material from the down 
features before the average pad surface is in contact with the down features. 

From the above discussion, the three pad properties determining planarity 
are pad stiffness, pad compressibility and surface roughness from pad asper- 
ities. Also the time dependence of these properties is important and must be 
considered. It is also probable that the relative importance of each depends 
on the material being poHshed. To date, no pubhshed model satisfactorily 
combines these into a single, working, predictive model. 

6.7.5 Macro-scratches and Defectivity 

Although pad hardness is believed to have only an indirect impact on pla^ 
narity, hardness remains an important physical property in terms of macro- 
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and micro-scratches, and other defects involving physical damage of the de- 
vice being polished. In general, to avoid scratches, softer materials such as 
aluminum [52] or copper are polished with softer pads, and harder materials 
like oxide and tungsten use harder pads. However, there axe many exceptions 
to this rule. For example, very soft Politex"^^ pads are used to poMsh tung- 
sten plugs. One reason to use harder pads to polish softer materials is the 
need for good planarity and, as previously discussed, higher hardness and 
stiffness are often linked. 

Defectivity also appears to be related to pad porosity. One explanation 
is that pores in the pad surface become clogged with polishing debris which 
can cause scratching and leave residual particles on the wafer surface. For 
copper CMP, it has been shown that an experimental unfilled Type 4 pad 
gave a significantly lower level of defects after polishing than a standard Type 
3 IClOlO™ pad [21]. 

6.7.6 Conductor Dishing and Oxide Erosion 

Conductor dishing and oxide erosion are feature-level poHshing parameters 
applicable to metal CMP and are especially important for next generation 
devices using copper metallization. As feature sizes decrease and conductors 
become smaller, copper is emerging as the preferred metallization because 
of its superior electrical conductivity over aluminum and tungsten. Copper 
damascene processes are being developed as the preferred method of manu- 
facturing [53]. In these processes, trenches are created in the oxide dielectric 
layer. Copper is then deposited by electroplating, filling these trenches and 
covering the surface of the rest of the chip. CMP polishing is used to polish 
away the excess copper, leaving only the inlaid interconnect lines. 

The objective of the copper pohshing step is to achieve planarity of fea- 
tures with only minimal loss in the cross-sectional area of the conductor lines, 
so that high conductance is maintained. However during poHshing, conductor 
dishing and oxide erosion occur which produce device feature deviations from 
the idealized damascene structures. Conductor dishing is defined as the dif- 
ference in elevation of the insulator region to the metal line and oxide erosion 
is the loss of dielectric within the conductor feature array. Additional oxide 
loss, known as field oxide loss, occurs globally because the oxide polish rate 
is non-zero during over-pohshing. Total conductor thickness loss which must 
be minimized is the sum of field oxide loss, local oxide erosion and conductor 
dishing. Figure 6.20 further illustrates the terms dishing (D), oxide erosion 
(E) and field oxide loss (F). 

Several authors [53, 54, 55, 56, 57, 58] have independently developed mod- 
els to explain the phenomenon of dishing. In general, their models are exten- 
sions of those used for within die planarity as discussed earher, and take into 
account pad properties such as pad stiffness, pad compressibility, hardness 
and surface roughness. 



6 CMP Polishing Pads 205 




Fig. 6.20. Schematic lUustration of Dishing, Oxide Erosion and 



In an early paper on dishing of copper damascene structures, Steigerwald 
et al [531 extended the pad deflection beam model of Sivaram et al. [59] to 
include the effects of both pad compressibility and roughness. Pad compres- 
sion was taken into account by assuming that only the surface lay^r of the 
pad compresses as it pushes against the wafer and, bs the pad ndes over 
a recess, that some of the compression is relieved. The assumption was made 
that if pad deflection is due to pad compression, then only the near-suxtace 
layer is assumed to bend. Pad roughness was incorporated mto the model 
using the concept of Rentehi and Coniff [60], in which the rough surface 
layer is assumed to have a lower elastic modulus than the bulk material In 
agreement with experimental observations, the model predicts that stiffer, 
smoother pads of low compressibility wifl give lower dishing. 

In a more recent paper, Yang [57] has attempted to develop a quantitative 
model of dishing based on a Prestonian analysis of different removal rates 
for high and low copper features. The model incorporates terms for pad 
bending and pad compressibility Calculated results of dishing axe m good 
agreement with experimental data, such that the model accurately predicts 
copper dishing as a function of hne-width and over-polish. The model further 
predicts an increase in copper dishing as feature size increases due to pad 
bowing. The wider the metal feature or the more compliant the pad, the 
more the pad can deform to remove metal within the dish. Thus the model 
predicts that both low pad compressibiUty and low conformity are desirable 

for reducing copper dishing. j;eu;„a- 
Nguyen et al. [58] have developed an alternative model to explam dishing 
based on the assumption that material removal occurs predominantly at the 
pad/wafer contacts. Their statistical contact mechanics model assumes: aj 
material removal occurs at the mechanical contact between pad asperities 
and wafer, b) distribution of pad asperity contact size is Gaussian c) dif- 
ferent removal rates occur for asperities with contact size smaller and larger 
than the hne width, d) Preston's law is valid, e) dishing occurs during over- 
pohshing, and f ) oxide erosion is neglected. Excellent agreement is obtained 
between experimental data and model predictions for both time dependence 
and feature size dependence of dishing. Their model predicts that pad surface 
morphology has the greatest unpact on asperity contact size distribution and 
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hence on dishing. This intuitively makes sense given that the typical rough- 
ness of a pad surface is comparable to the size of the conductor features being 
polished. Also it predicts the importance of pad conditioning in. producing 
an optimized surface topography for poUshing. 

Closely related to dishing is the phenomenon of oxide erosion. This is 
a thinning of the oxide layer resulting from a non-zero oxide polish rate 
during the over-polish step. Like dishing, it causes a reduction in conductor 
cross-section and is equally undesirable. As discussed above, less compressible 
and stiff er pads are preferred to minimize dishing. However, such pads also 
tend to be harder which can lead to increased oxide erosion, since harder pads 
tend to have higher oxide pohsh rates [53]. Thus from a pad perspective, there 
are opposing needs in terms of pad hardness. One solution to the problem 
is to use a fairly hard pad in combination with a slurry which has a much 
higher removal rate for copper than oxide. Such slurries are discussed in detail 
elsewhere in this text. 

Yang [57] has extended his quantitative model of dishing to describe oxide 
erosion. The model accurately predicts oxide erosion as a function of pattern 
density and over-polish time. 

6.7.7 Pad Conditioning 

As mentioned already several times in this chapter, pad conditioning has 
a major impact on all aspects of poUshing performance. Many pad properties 
affect and are influenced by pad conditioning. As an example of the former, it 
is found that pads containing porosity (Type 3) are more readily conditioned 
than Type 4 pads. This is because porosity reduces the abrasion resistance 
of the pad which facilitates the creation of microtexture by the diamond 
conditioner. The mechanism of microtexture creation is also affected by the 
tensile, hardness and modulus properties of the pad. For pads which have high 
modulus and hardness with low ductility, the conditioning disk diamonds cut 
channels by preferentially fracturing and removing pad material. In contrast, 
for softer, lower modulus and more ductile pads, the diamonds cause plastic 
flow and plough the material aside rather than by physically abrading it to 
create channels. So in the former cause, pad conditioning results in higher pad 
wear, whereas in the latter case pad loss during poHshing is significantly less, 
since the surface is essentially being rearranged. This is especially true when 
polishing softer materials such as copper which do not themselves significantly 
wear the pad surface. 

Examples of pad properties which are affected by conditioning are, of 
course, pad roughness but also pad hydrophilicity. The process of roughening 
the pad surface by conditioning in the presence of either water or poHshing 
fluid increases the critical surface tension of the pad and makes it more hy~ 
drophilic. This aids the wetting of the pad surface by the poHshing fluid and 
allows the fluid to spread uniformly across the pad surface and under the 
wafer. 
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6.8 Slurryless Pad Technology 

Although "Slurryless Technology" is covered in a separate chapter in this 
book, for completion given its emerging importance, it is appropriate to in- 
clude a brief discussion of Slurryless Pad Technology m this chapter. 

ConventionaUy, CMP polishing is accomplished by using an abrasive 
particle-containing slurry in combination with a poHshing pad. While such 
slurries are universally employed, it is recognized that their use gives rise to 
significant problems: 

1. Although polishing is practiced in a clean room environment, the par- 
' tides themselves are a serious source of contamination when polishing 

semiconductor devices and can leave residual particles on the polished 

2. The quaUty of the surface produced during poUshing is highly dependent 
upon the particle size distribution and composition of the particles in 
the slurry. Anomalously large particles, even in extremely small concen- 
trations, are commonly responsible for scratches and other post-polish 
mechanical defects. These are highly deleterious to the yield of semi- 
conductor devices processed by polishing. At the soMs content of these 
poUshing slurries (typicaUy >12%) it is practically impossible to use fil- 
tration to remove the oversize particles due to clogging effects on the filter 
medium. Thus expensive and time consuming efforts have been made to 
control and reduce oversize particles in the slurries employed. However, 
there are few practical safeguards against their accidental introduction. 

3. A desirable practice in CMP polishing is to reuse or recirculate the pol- 
ishing slurry to reduce manufacturing cost and the quantity of waste 
products from the operation. However, the activity of pohshing slurries 
is commonly observed to vary with time when recirculated. This may be 
due to the addition of dross, or pohshing byproducts from the substrate 
into the slurry solution, attrition or breakdown of the poHshing particles 
themselves during use, or . chemical changes in the particles which reduce 
activity. This level of variation in recirculated slurries is unacceptably 
high for processing semiconductor devices. Recirculation of this system 
is exceedingly difficult because the byproducts of the pohshing process 
are often practically indistinguishable from the original slurry particles, 
and it is equaUy impossible to control their size or remove them from the 
solution. In consequence, the sohd particle content of the recirculated 
slurry continuously increases with time. As the polishing rate is directly 
proportional to the sohds content of the slurry, practical control of the 
pohshing rate is difficult. A serious additional problem is the acciden- 
tal incorporation of oversize contaminant particles into the recirculating 
slurry, often due to substrate breakage. The aforementioned difficulties 
in filtering slurries make it virtually impossible to remove these contam- 
inants. 
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Because of the above concerns, recirculation of slurry is not widely prac- 
ticed in the pohshing of most semiconductor devices because of the need 
to control activity precisely and the avoidance of damage by contaminants. 
Slurry is usually simply iised once and disposed of as waste. As a result, the 
cost of slurry and slurry waste disposal is the single largest contributor to 
the cost of polishing semiconductor devices. 

Prom the above, it is clear that a polishing process which uses a particle- 
free Uquid which can be readily recirculated and kept in a constant particulate- 
free state, would be preferred in the processing of semiconductor wafers over 
conventional particle-containing slurries. Such particle-free Uquids, commonly 
known as "Reactive Liquids" are beginning to find ax:ceptance for CMP pol- 
ishing [65, 66, 67]. Typically, reactive Mquids are based on similar chemistries 
to those of conventional particle-containing slurries and are used in combina- 
tion with pohshing pads which themselves contain abrasive particles incor- 
porated into the pad matrix. 

Table 6.16 shows recent patents describing reactive liquid chemistries and 
types of slurryless pads. The reader is referred to these patents and the "Slur- 
ryless Technology" chapter of this book for a more detailed discussion of 
polishing using slurryless technology. 

6.9 Future Trends in Polishing Pads 

For the foreseeable future, CMP polishing will remain as an integral man- 
ufacturing operation in the production of semiconductor devices. As CMP 
pohshing evolves and semiconductor devices become more complex with finer 
feature geometries and more metallization layers, the need for more repro- 
ducible, precision pohshing performance will increase. This will necessitate 
improved polishing pads with less "pad to pad" variability which provide 
more predictable and consistent pohshing performance. CMP will become 
more of a standard process and semiconductor manufacturers will contmue 
to expect reduced cost of ownership for pohshing consumables. This will ini- 
tially impact the more established CMP poMshing operations, such as oxide 
and timgsten polishing, but will eventually also impact still emerging pro- 
cesses, such as copper CMP. 

With respect to polishing pads, it is expected that polyurethanes will re- 
main as the preferred polymer type because of their attractive combination 
of properties, their versatile chemistry, the ability to control pad properties 
over wide ranges, and the ability to form pads by a wide range of manufactur- 
ing processes. Pad suppliers will continue to develop alternative, simpler pad 
manufacturing processes with the goals of more consistent pad properties at 
reduced cost. 

Work will continue by pad supphers, semiconductor manufacturers, and 
academics to develop a more comprehensive understanding of the relation- 
ships between pad properties and polishing performance. This work will by 
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Table 6.16. Patents describing Reactive Liquids and Slurryless Pads 



Patent # 


Assignee 


Brief Description of Invention 


US4,393,628, US4,466,218, 
US4,613,345 


IBM 


Slurryless pads for disk 
polishing, comprising 
a polyurethane foam containing 
fixed alumina abrasive 


US5,152,917, US5,304,223, 
US5 435 816, US5,454,844, 
US5,549,961, US5,692,950, 
US5,958,794, US6,007,407 


Minnesota 
Mining and 
Manufacturing 
Company 


Extension of 3M's 
Microreplication technology. 
Photopolymerizable slurryless 
pads comprising 
a three-dimensional 
fixed-abrasive element 


US5,725,417, US5,782,675 


Micron 


Conditioning and refurbishment 
of fixed-abrasive pads 


US5,759,918 


Obsidian 


Linear polishing tool for 
fixed-abrasive pads 


US5,932,486, US6,030,899 


Rodel 


Slurryless polishing using 
reactive liquids free from 
particulate matter in 
combuiation with polishing pads 
having a multiplicity of surface 
nanoasperities. 


US6 022 264, US6,099,394, 
US6,069,080 


Rodel 


Slurryless polishing pads 
comprising a high modulus 
phase and a low modulus phase 


WOQ9 755493 


Ebara 


Grinding wheel for slurryless 
poUshing comprising abrasive 
grains and pores in a polymeric 

binder 


EP 0 874 390 Al 


Hitachi 


Grindstone for slurryless 
polishing comprising abrasive 
grains and a bonding resin 


US6,117,775 


Hitachi 


An abrasive-free reactive liquid 
for polishing metal films 



comprising an oxidizer and 

a substance which renders oxides 

water-soluble 



necessity encompass all aspects of the polishing process, including interac- 
tions of the pad with the poUshing fluid, the role of conditioning and polishing 
parameters, and of poUshing tool design. Prom such work, it is expected that 
polishing consumable suppUers wiU be able to offer manufacturers a total 
polishing solution consisting of, for example, an optimized pad, fluid, condi- 
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tioner combination. Such combinations will enable synergistic performance 
advantages for the customer. 

Industry trends that are currently in their infancy which will impact pad 
suppliers are the shift to 300 mm. wafers, poUshing with abrasive-free reactive 
liquids, the increasing use of copper metaUization in combination with low 
K dielectrics, and the growing importance of linear polishing tools. All four 
are driven by the need to reduce semiconductor device manufacturing costs, 
improve yields and performance, and to increase throughput. These issues are 
separately discussed in detail elsewhere in this book. The important point to 
note is that the pad can no longer be developed in isolation from the rest of 
the polishing system but must be developed and optimized with respect to 
the poUshing tool, the type of wafers being poUshed, and the poUshing fluid. 
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